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Rewriting Codes for Flash Memories 
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Abstract — Flash memory is a non-volatile computer mem- 
ory comprising blocks of cells, wherein each cell can take 
on q different values or levels. While increasing the cell level 
is easy, reducing the level of a cell can be accomplished only 
by erasing an entire block. Since block erasures are highly 
undesirable, coding schemes — known as floating codes (or 
flash codes) and buffer codes — have been designed in order 
to maximize the number of times that information stored in a 
flash memory can be written (and re-written) prior to incurring 
a block erasure. 

An (n,k,t)q flash code C is a coding scheme for storing 
k information bits in n cells in such a way that any se- 
quence of up to t writes can be accommodated without a 
block erasure. The total number of available level transitions 
in n cells is n(q— 1), and the write deficiency of C, defined 
as 5(C) = n(q—l) — t, is a measure of how close the code comes 
to perfectly utilizing all these transitions. In this paper, we show 
a construction of flash codes with write deficiency O(qklogk) if 
q ^ log 2 fc, and at most 0(k\og 2 k) otherwise. 

An (n,r,£,t)q buffer code is a coding scheme for storing 
a buffer of r £-ary symbols such that for any sequence of t 
symbols it is possible to successfully decode the last r symbols 
that were written. We improve upon a previous upper bound on 
the maximum number of writes t in the case where there is a 
single cell to store the buffer. Then, we show how to improve a 
construction by Jiang et al. that uses multiple cells, where n > 2r. 



Index Terms — Coding theory, flash memories, flash codes, 
buffer codes. 



I. Introduction 

Flash memories are, by far, the most important type of 
nonvolatile computer memory in use today. Flash devices 
are employed widely in mobile, embedded, and mass-storage 
applications, and the growth in this sector continues at a 
staggering pace. 

A flash memory consists of an array of floating-gate cells, 
organized into blocks (a typical block contains about 2 20 
cells). The level or "state" of a cell is a function of the 
amount of charge (electrons) trapped within it. In multilevel 
flash cells, voltage is quantized to q discrete threshold values; 
consequently the level of each cell can be modeled as an 
integer in the range 0, 1, ... , cj—1. Nowadays, the parameter 
q itself can range from q = 2 (the conventional two-state 
case) up to q = 16 and it can reach even higher values (6). 
The most conspicuous property of flash-storage technology 
is its inherent asymmetry between cell programming (charge 
placement) and cell erasing (charge removal). While adding 
charge to a single cell is a fast and simple operation, removing 
charge from a cell is very difficult. In fact, flash technology 
does not allow a single cell to be erased — rather, only 
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entire blocks can be erased. Such block erasures are not 
only time-consuming, but also degrade the physical quality 
of the memory. For example, a typical block in a multilevel 
flash memory can tolerate only about 10 4 or even fewer 
erasures before it becomes unusable, and as such the lifetime 
and performance of the memory is highly correlated with 
the frequency of block erasure operations. Therefore, it is 
of importance to design coding schemes that maximize the 
number of times information stored in a flash memory can be 
written (and re-written) prior to incurring a block erasure. 

Such coding schemes — known as floating codes (or flash 
codes) and buffer codes — were recently introduced in (JJ, 
0, |9). Since then, several more papers on this subject have 
appeared in the literature 0, HOj— H21, (BJ, Q9|. It should be 
pointed out that flash codes and buffer codes can be regarded 
as examples of memories with constrained source, which were 
described in ifPII . Yet another example of such codes are the 
write-once memory (WOM) codes 13, J4), ITF71 . that have 
been studied since the early 1980s. In fact, flash codes may be 
regarded as a generalization of WOM-codes. Slightly different 
and yet very related are the rank modulation codes fPHl . fl4l . 
In rank modulation, the information is not stored according to 
the exact cell levels but rather by the cell permutation which 
is derived from the ordering of these levels. 

An (n,k,t)q flash code C is a coding scheme for storing 
k information bits in n flash-memory cells, with q levels 
each, in such a way that any sequence of up to t writes 
can be accommodated without incurring a block erasure. In 
the literature on flash codes, a write is always a bit-write — 
that is, a change — » 1 or 1 — > in the value of one of 
the k information bits. Observe that in order to accommodate 
such a write, at least one of the n cells must transition 
from a lower level to a higher level (since a cell's level, 
determined by its charge, can only increase). On the other 
hand, the total number of available level transitions in n flash 
cells is n(q— 1). Thus, throughout this paper, we characterize 
the performance of a flash code C in terms of its write 
deficiency, defined as 6(C) = n(q—l) — t. According to the 
foregoing discussion, 6(C) is a measure of how close C comes 
to perfectly utilizing all the available cell-level transitions: 
exactly one per write. The primary goal in designing flash 
codes can thus be expressed as minimizing deficiency. 

What is the smallest possible write deficiency 
6q(n,k) for an (n,k,t)q flash code, and how does it 
behave asymptotically as the code parameters k and n get 
large? The best-known lower bound, due to Jiang, Bohossian, 
and Bruck (9), asserts that 



6 q (n,k) ^ -(q - l) min{n,fc-l} 



(1) 



How closely can this bound be approached by code construe- 
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tions? It appears that the answer to this question depends 
on the relationship between k and n. In this paper, we are 
concerned mainly with the case where both k and n are 
large, and n is much larger than k (in particular, n ft: 2 ). 
In SectionlVTi we consider the case where k/n is a constant. 
At the other end of the spectrum, the case k > n has been 
studied in fl2l . 

The first construction of flash codes for large k was reported 
by Jiang and Brack iflOl , ifTTl . In this construction, the k 
information bits are partitioned into m\=k/k' subsets of 
k' bits each (with k' ^ 6) while the memory cells are 
subdivided into mi ^ m\ groups of n' cells each. Additional 
memory cells (called index cells) are set aside to indicate 
for each subset of k' bits which group of n' memory cells 
is used to store them. The deficiency of the resulting flash 
codes is 0{^/qn). Note that for n ^ k, the lower bound on 
write deficiency in (Q~|i behaves as Cl(qk), and thus does not 
depend on n. Consequently, the gap between the Jiang-Bruck 
construction IfTTl and the lower bound could be arbitrarily 
large, especially when n is much larger than k. 

In lfl9l . a different construction of flash codes was proposed. 
These codes are based upon representing the n memory cells 
as a high-dimensional array, and achieve a write deficiency 
of 0(qk 2 ). Crucially, the deficiency of these codes does 
not depend on n. Nevertheless, there is still a significant 
gap between 0(qk 2 ) — which is the best currently known 
deficiency result — and the lower bound of Cl(qk). 

In this paper, we present a new construction of flash codes 
which reduces the gap between the upper and lower bounds on 
write deficiency to a factor that is logarithmic in the number 
of information bits k. This result is arrived at in several stages. 
As a starting point, we use the "indexed" flash codes of Jiang 
and Brack IfTTl . In Section llVl we develop new encoding and 
decoding procedures for such codes that eliminate the need for 
index cells in the Jiang-Bruck construction IfTTl . The write de- 
ficiency achieved thereby is 0(qk 2 ), which coincides with the 
main result of 1191 . When the encoding procedure developed in 
SectionlTVlreaches its limit, there are still potentially numerous 
unused cell-level transitions. In Section|V] we show how to 
take advantage of these transitions in order to accommodate 
even more writes. To this end, we introduce a new indexing 
scheme, which is invoked only after the encoding method of 
SectionllVl reaches its limit. Thereupon, we extend this idea 
recursively, through [log 2 fc] different indexing stages. This 
leads to a result, established in Theorem01 stating that 

0.(qk) s= 6 q (n,k) ^ 0(max{q,log 2 k} klogk) (2) 

for all n ^ k 2 , where the upper bound is achieved construc- 
tively by the flash codes described in SectionlVl In Section [VT] 
we present and discuss constructions of flash codes for the case 
where the number of memory cells n is not significantly larger 
than the number of bits k. 

The other type of codes we discuss in this paper are the 
buffer codes. An (n, r, £, t)q buffer code is a coding scheme for 
storing a buffer of r £-ary symbols such that for any sequence 
of t symbol writes, it is possible to successfully decode the 
last r symbols that were written without a block erasure. Given 



a buffer of r £-aiy symbols that has to be stored in n q-aiy 
cells, the goal is to maximize the number of writes f. 

In Section IVIII we formally define buffer codes. Then, we 
study two extreme cases where the number of cells is either 
one or very large. For the former case, Jiang et al. gave in (JJ, 
ifTOl a construction as well as an upper bound on the number 
of writes. Their construction works for n — 1,£ — 2 and 
+ r — 2 writes. The upper bound stated 
asserts that 



guarantees t 
in Q, OH for n 
- 1 
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r+ [((<?-!) mod (i r -l) + l)J. 



We will show how to improve this bound such that for q ^ 



q-l r 



7 £d 



r, 



where <p is Euler's cp function. 

If the buffer is binary (£ = 2) and the number of cells is 
significantly larger than the buffer size r, then a trivial upper 
bound on the number of writes t is n(q — 1). Jiang et al. 
showed in UJ, ifTOll how to achieve f = (q — l)(n — 2r + 
1) + r — 1 writes. Assume that q = 2, then the number of 
writes is n — r and after the z'-th write, the buffer is stored 
between cells i + 1 and i + r. If q > 2, then the cell levels are 
used layer by layer, where first only levels zero and one are 
used, then one and two, and so on. In the transition from one 
layer to another, first the buffer is copied and stored in the new 
layer and then more writes are allowed. Thus, this construction 
allows n — r writes on the first layer and n — 2r + 1 more 
writes in all other layers, so the total number of writes is t = 
n - r + (q - 2)(n - 2r + 1) = (q - l)(n - 2r + 1) + r - 1. 
We will show how to improve this construction such that in 
every transition between layers, the buffer is stored cyclically 
in the cells and thus is not copied as before. This improves 
the number of writes to (q — l)(n — r). 

II. Preliminaries and Flash Codes Definition 

Let us now give a precise definition of flash codes that 
were introduced in the previous section. We use {0, l}* to 
denote the set of binary vectors of length k, and refer to the 
elements of this set as information vectors. The set of possible 
levels for each cell is denoted by Aq = {0, 1, . . . , q—1} and 
thought of as a subset of the integers. The q" vectors of length 
n over Aq are called cell-state vectors. With this notation, 
any flash code C can be specified in terms of two functions: 
an encoding map £ and a decoding map T>. The decoding 
map T> : Aq — > {0,1}^ indicates for each cell-state vector 
x E Aq the corresponding information vector. In turn, the 
encoding map £ : {0, 1, . . . ,k— 1} xAi 1 — > Aq U {E} assigns 
to every index i and cell-state vector x E Aq, another cell- 
state vector y = £{i,x) such that yj^Xj for all j and 
T>(y) differs from T>(x) only in the f-th position. If no such 
yE Aq exists, then £{i,x) = E indicating that block erasure 
is required. To bootstrap the encoding process, we assume 
that the initial state of the n memory cells is (0,0,... ,0). 
Henceforth, iteratively applying the encoding map, we can 
determine how any sequence of transitions — » 1 or 1 — » 
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(y\,yz,...,yn) = (xx,x 2 ,...,x n ) ; 

i x = f ind_left_cell (y 1; y 2 , ■ ■ ■ , y n ) ; 

i 2 = f ind_right_cell {y\, y 2 , ■ ■■ ,y n ) ) 

if (z 2 == 0) return E; 

if == i 2 ) II there is only one non-full cell 

{if (;==2) a = 2; 

else a = j + 2- (y (1 (mod 2) ) ; 

if (y (1 + a > - 1) return E; 

else { y h = y h + a; return; } } 

if ((i 2 -xi ==D A (y f/ = 9 -l)) 
{o f/ =0; o,- 3 . = y h _. (mod 2); 

a = 2 ■ 57 2 + v x - {y h _ . (mod 4) ) ; 

if(a<0) y i3 . = y i3 . + A + x; 

else y» 3 , = y; 3 , + «; } 



in the information bits maps into a sequence of cell-state 
vectors, eventually terminated by the block erasure. This leads 
to the following definition. 

Definition. An (n, k)q flash code <C(T> ,£) guarantees t writes 

if for all sequences of up to t transitions — » 1 or 1 — » in the 
k information bits, the encoding map £ does not produce the 
block erasure symbol E. If so, we say that C is an (n,k,t)q 
code, and define the deficiency of C as 6(C) = n(q—l) — t. 

In addition to this definition, we will also use the following 
terminology. Given a vector x = (x\,x 2 , . . . ,x m ) over Aq, we 

define its weight as wt(x) = X\ + X\ ^ + x m (where the 

addition is over the integers), and its parity as wt(x) mod 2. 

III. Two-Bit Flash Codes 

In this section, we present a construction of flash codes that 
uses n q-ary cells to store k = 2 bits. In (9), a construction 
with these parameters was presented and was shown to be 
optimal. The construction we present in this section will be 
proved to be optimal as well and we believe that it is more 
intuitive. 

In this construction, the leftmost and rightmost cells corre- 
spond to the first and second bit, respectively. When rewriting, 
assume the first bit changes its value, then the leftmost cell 
of level less than ^ — 1 is increased by one level. Similarly, 
whenever the second bit changes its value, the rightmost cell 
of level less than ^ — 1 is increased by one level. In general, 
the cell-state vector has the following form: 

(q-l,...,q-l,Xi,0,...,0,Xj,q-l,...,q-i), 

where < x,,Xj ^ q — 1. This principle repeats itself 
until only one cell is left with level less than q — 1. Then, 
this cell is used to store two bits according to its residue 
modulo 4. If this residue is 0, 1, 2, 3 then the value of the 
bits is {vi,V 2 ) = (0,0), (1,0), (0,1), (1,1), respectively. 
The construction is presented for odd values of q and we 
will discuss later how to modify it for even values as well. 
In the remainder of the paper, these maps are described 
algorithmically, using (C-like) pseudo-code notation. 
Decoding map T> 2 $ : The input to this map is a cell-state 
vector x = (x\,x 2 , . . . ,x n ). The output is the corresponding 
two-bit information vector (v\,v 2 ). 



Encoding map £ 2 b- The input to this map is a cell-state 
vector x = (x\,x 2 , . . . ,x n ), and an index j E {1,2} of the 
bit that has changed. Its output is either a new cell-state vector 
y = (yi' Vli ■ ■ ■ > yn) or me erasure symbol E. 



The function f ind_left_cell (t/i, y 2 , . . . ,y n ) finds 
the leftmost cell of level less than q — 1 and if there is 
not such a cell then it returns n + 1. Similarly, the func- 
tion f ind_right_cell {y\, y 2 , . . . , y n ) finds the right- 
most cell of level less than q — 1 and if there is not such a 
cell then it returns 0. The notation y\. stands for the variable 
yij in case 7 = 1, and y t - if j = 2. The same rule applies 
to y; 3 _.. The symbol A stands for the logical operator "and". 
The next theorem proves the number of writes this construction 
guarantees. 

Theorem 1. If there are n q -level cells and q is odd, then the 
code C(£>2B/ £ib) guarantees at least t = (n — l)(q — 1) + 
writes before erasing. 

Proof: As long as there is more than one cell of level less 
than q — 1, the weight of the cell-state vector increases by one 
on each write. This may change only after at least (n — 1) (q — 
1) writes. Assume that there is only one cell of level less than 
q — 1 after s = (n — l)(q — 1) + fc writes, where k ^ 0, and 
call it the z'-th cell. Starting this write, the different residues 
modulo 4 of the z'-th cell correspond to the four possible two- 
bit information vector (vi,v 2 ). Therefore, on the s-th write, 
we also need to increase the level of the z'-th cell so it will 
correspond to the correct information vector on this write. For 
all succeeding writes, if the second bit changes then the z'-th 
cell increases by two levels. If the first bit changes from to 
1 then the z'-th cell increases by one level and otherwise by 
three levels. Therefore, if there are m more writes and V\ — 
then the z'-th cell increases by at most 2zzz levels, and if there 
are m more writes and V\ = 1 then the z'-th cell increases by 
at most 2zzz + 1 levels. 

Let us consider all possible values of k and the information 
vector [v\,v 2 ) on the s-th write in order to calculate the 
number of guaranteed writes before erasing. Note that on the 
s-th write (v\ + v 2 ) = s(mod 2). Furthermore, since q is 
odd, the value of the bit that is written changes from one to 
zero because it reaches level q — 1, and thus the other bit has 
value A: (mod 2). 



11 = f ind_left_cell (y lr y 2 ,...,y n ); 

1 2 = f ind_right_cell (z/i, y 2 , . . .,y n ) ; 
i f ( i 2 == ) II all cells are full 

{vi = q-l (mod 2) ; v 2 = [ ( {q - D (mod 4) ) /2j ; } 
if (z'i ==i 2 ) II there is only one non-full cell 
{vi = y h (mod 2) ; v 2 = [ (y h (mod 4) ) /2j ; } 
if (z'i \=i 2 ) II there are at least two non-full cells 
{v x = y h (mod 2) ; v 2 = y, 2 (mod 2) ; } 
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1) Assume fc(mod 4) = 0, then {v\,v 2 ) = (0,0) and the 
level of the z'-th cell does not increase on the s-th write. 
Since V\ = 0, after m writes the cell increases by at 



most 2m levels. Hence, there are at least 



q-l-k 



writes and the total number of writes is at least 



\){q-\)+k + 



1 



> („-l)( ? -l) + 



2 y 2 

2) Assume fc(mod 4) = 1, then (vi,V 2 ) = (1,0) or 
(vi,v 2 ) = (0,1). If (vi,v 2 ) = (1,0) then on the s- 
th write the z'-th cell does not increase its level and after 
m writes its level increases by at most 2m + 1 levels. If 
(v\, v 2 ) = (0, 1) then the z'-th cell increases by one level 
and after m writes its level increases by at most 2m more 

a— 2— k 

levels. Hence, in both cases there are at least — ~ — 
more writes. Together we get that the total number of 
writes is at least ~ , ., 



l)( fl -l)+fc + 



> ( n _!)( ? _l) + 



3) Assume /c(mod 4) = 2, then (pi,V 2 ) = (0,0) and 
the z'-th cell increases by two levels on s-th write. 
Since V\ = 0, after m more writes the cell increases 
by at most 2m levels and hence there are at least 
l(q — 1 — (k + 2))/2J more writes, where k ^ 2. 
Therefore, the total number of write is at least 



1)(T 



+ q I - >(n-l)(y- 



D + 



2 v 2 

4) Assume /c(mod 4) = 3, then {v\,v 2 ) = (1,0) or 
(p l7 t; 2 ) = (0,1). If (vi,v 2 ) = (1,0) then on the s- 
th write the z'-th cell increases by two levels and after 
m more writes it increases by at most 2m + 1 levels. If 
iPl/V?) = (0,1) then the z'-th cell increases by three 
levels and after m more writes it increases by at most 
2m more levels. Hence there are at least ? i - more 
writes, where k ^ 3. Thus, the total number of writes is 
at least 



(n-l)(<j-l)+fc + 



l*>(„-l)( 9 -l) + £-i. 



In any case, the guaranteed number of writes is (n — l)(q — 

1) " 

For even values of q, the construction is very similar. As 

long as there is more than one cell of level less q — 1 we follow 
the same rules for the encoding. For the decoding, since q — 1 
is no longer even, the value of V\ is the parity of the cells 
1, . . . , z'i, where i\ is the leftmost cell of value less q — 1. The 
value of v 2 is the parity of the cells i 2 , i 2 + 1, . . . , n, where 
i 2 is the rightmost cell of value less q — 1. If there is only 
one cell left, then it represents a value of two bits as before 
according to its residue modulo 4. If the the index of the last 
available cell is i then 

v 1 = (z-l + y,)(mod2), 

v 2 = ((n - i) + L(y,(mod4))/2j)(mod2). 

Also, the last cell does not reach level q — 1 so it is always 
possible to distinguish what the last cell is. We omit the tedious 
details as the proof is similar to the case where q is odd. 



IV. Index-less Indexed Flash Codes 

Our point of departure is the family of so-called indexed 
flash codes, due to Jiang and Brack 1111 . that were briefly 
described in SectionG] In this section, we eliminate the need 
for index cells — and, thus, the overhead associated with these 
cells — in the Jiang-Bruck construction ifTTl . This is achieved 
by "encoding" the indices into the order in which the cell 
levels are increased. 

As in 1111 . we partition the n memory cells into m groups of 
n' cells each. However, while in IfTTl the value of n' is more 
or less arbitrary, in our construction n' = k. We henceforth 
refer to such groups of n' — k cells as blocks (though they 
are not related to the physical blocks of floating-gate cells 
which comprise the flash memory). We will furthermore use, 
throughout this paper, the following terminology. We say that: 

► a block is full if all its cells are at level q—1; 

► a block is empty if all its cells are at level zero; 

► a block is active if it is neither full nor empty; 

► a block is live if it is not full (either active or empty). 
In our construction, each block represents exactly one bit. This 
implies that the total number of blocks, given by m = [n jk\ , 
must be at least k, which in turn implies n ^ k 2 . If n is 
not divisible by k, the remaining cells are simply left unused. 
Finally, we also assume that either k is even or q is odd. If 
this is not the case, we can invoke the same construction with 
k replaced by k + 1 (and the last bit permanently set to zero). 

The key idea is that each block is used to encode not only 
the current value of the bit that it represents, but also which of 
the k bits it represents. The value of the bit is simply the parity 
of the block. The index of the bit is encoded in the order in 
which the levels of the k cells are increased. For example, if 
the block stores the z'-th bit, first the level of the z'-th cell in the 
block is increased from to q— 1 in response to the transitions 
— > 1 and 1 —7- in the bit value. Then, the same procedure is 
applied to the (z'+l)-st cell, the (z'+2)-nd cell, and so on, with 
the indices z + 1, z + 2, . . . interpreted cyclically (modulo k). 
This process is illustrated in the following example. 

Example 1. Suppose that k = 4 and q — 3. If a block 
represents the first bit, then its cell levels will transition from 
(0,0,0,0) to (2,2,2,2) in the following order: 

(0000) -> (1000) (2000) -> (2100) -> (2200) 

(2210) (2220) -> (2221) (2222) 

On the other hand, for a block that represents the second bit, 
the corresponding cell-writing order is given by: 

(0000) (0100) (0200) -> (0210) -> (0220) 

-» (0221) -> (0222) (1222) (2222) 

The cell-writing orders for blocks that represent the third and 
fourth bits are given, respectively, by 

(0000) -> (0010) -> (0020) -»■ (0021) -> (0022) 
-> (1022) -» (2022) -> (2122) -> (2222) 

and 

(0000) -> (0001) (0002) -> (1002) -> (2002) 
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-> (2102) -> (2202) -> (2212) -> (2222) 

Note that, unless a block is full, it is always possible to 
determine which cell was written first and, consequently, 
which of the k = 4 bits this block represents. 

We now provide a precise specification of an (n,k)q flash 
code C based upon this idea, in terms of a decoding map T>q 
and an encoding map £q. 

Decoding map T>q : The input to this map is a cell-state vector 
x = (xi\x2\ ■ ■ ■ \x m ), partitioned into m blocks. The output is 
the corresponding information vector (vq, V\, . . . , Cfc-l)- 

(v Q ,v x ,... ,v k _i) = (0,0,..., 0); 

for (; = 1; / ^ m; j = j + 
if (active (at;) ) 

{ i = read_index (xA ; Vj = parity (xi) ; } 



Encoding map £q : The input to this map is a cell-state vector 
x = (xi\x2\ ■ ■ ■ \x m ), partitioned into m blocks of k cells, and 
an index i of the bit that has changed. Its output is either a 
cell-state vector y = (y-j |y,| ' ' ' |y m ) or the erasure symbol 
E. 



To complete the specification of the flash code C(T>q,£q), 
let us elaborate upon all the functions used in the pseudo-code 
above. The function active {x) , respectively empty {x) , 
simply determines whether the given block is active, respectiv- 
ely empty. The function parity {x) computes the parity of 
x, defined in Section|II] Note that the parity of a full block 
is always zero (since k(q— 1) is even, by assumption). The 
function read_index (x) computes the bit-index encoded 
in an active block x = (xq, x\, . . . , Xjt-l)- This can be 
done as follows. Find all the zero cells in x. Note that 
these cells always form one cyclically contiguous run, say 
X;, Xj + i, . . . , Xj +r (where the indices are modulo k). Then the 
index of the corresponding bit is i — j + r + 1 (modfc). If 
there are no zeros in x, there must be exactly one cell, say Xj, 
whose level is strictly less than cj—1. In this case, the bit-index 
is i = 7 + 1 (modi:). The function write (y) proceeds 
along similar lines. Find the single cyclically contiguous 
run of zeros in (y , y lr . . . , y fc _i), say y } , y j+l , y j+r . If 
yj-l<cj— 1, increase yy_j by one; otherwise set yi — 1. 
If there are no zeros in y, find the unique cell y j such that 
y; < q—1 and increase its level by one. Finally, the function 
write_new(z, y) simply sets y, = 1. 

Theorem 2. The write deficiency of the flash code C(Vq,£q) 



described above is at most 

+ - l) = 0(qk 2 ) (3) 

Proof. Note that at each instance, at most k of the m blocks 
are active. The encoding map £q(1,x) produces the symbol 
E when there are no more empty blocks, and none of the 
active blocks represents the z-th bit. In the worst case, this 
may occur when there are k — 1 active blocks, each using just 
one cell level. This contributes (k — l)(k(q—l) — l) unused 
cell levels. In addition, there are at most k — 1 cells that are 
unused due to the partition into m = \ n jk\ blocks of exactly 
k cells. These contribute at most (k — l)(q—l) unused cell 
levels. | 

V. Nearly Optimal Construction 

It is apparent from the proof of Theorem|2] that the defi- 
ciency of the flash code C(T>q, £q), constructed in SectionllVl 
is due primarily to the following: when writing stops, there 
may remain potentially large amount of unused cell levels. The 
key idea developed in this section is to continue writing after 
the encoding map £ g produces the erasure symbol E, utilizing 
those cell levels that are left unused by £q. Obviously, it is 
not possible to continue writing using the same encoding and 
decoding maps. However, it may be possible to do so if, at 
the point when £q produces the erasure symbol E, we switch 
to a different encoding procedure, say £\. In fact, this idea 
can be applied iteratively: once £\ reaches its limit, we will 
transition to another encoding map £2, then yet another map 
£3, and so on. 

Assuming that k = (mod 4), here is one way to continue 
writing after the encoding map £ q has been exhausted. When 
£ produces the erasure symbol E, we say that the^rsf stage of 
encoding is over and transition to the second stage, as follows. 
First, we re-examine the cell-state vector x = (xi|x2| ' ' ' \x m ) 
and re -partition it into 2m = 2 [n/k\ blocks ofk/2 cells each. 
Most of these smaller blocks will already be full, but we may 
find some ni\ of them that are either empty or active (live). 
Observe that ni\ ^ 2 (k — 1) since at the end of the first stage, 
there are at most k — 1 active blocks of k cells, and each of 
them produces at most two live (non-full) blocks ofk/2 cells. 

If mi ^ k, we can continue writing as follows. Once again, 
each of the m\ blocks will represent exactly one bit; as before, 
the value of this bit is determined by the parity of the block. As 
part of the transition from the first stage to the second stage, 
we record the current information vector (vq, V\, . . . , 1^-1) m 
the first k of the m\ live blocks, say x\,X2,- ■ .,x^. To this 
end, whenever parity (Xj) ^ Dj— 1, we increase the level of 
one of the cells in X{ by one; otherwise, we leave x, as is. 

Since the blocks now have k/2 cells rather than k cells, 
it is no longer possible to encode in each block which of 
the k information bits it represents. Therefore, we set aside 
for this purpose 2(k— 1) [log^ (fc+2)] index cells (that are 
not used during the first stage). These cells are partitioned 
into 2(k— 1) blocks of \x = [log^ftM^)] cells each, which 
we call index blocks. Henceforth, it will be convenient to 
refer to the blocks of k/2 cells as parity blocks, in order to 
distinguish them from the index blocks. Initially, the first k 



Cl/l 1 3/2 1 ■ ■ ■ \Vm) = (*ll*2l ' ' ' \Xm)} 

for (;' = 1; ; ^ m; / = j + 

if (active (xj) A (read_index (Xj) == i) ) 

{ write (y ; ) ; break; } 

if {j == m + 1) // active block not found 

for (;' = 1; j ^ m; j = j + 

if (empty (Xj)) { write_new (/, y ■) ; break;} 

if (j==m + l) II no empty blocks remain 
return E; 
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index blocks u\, u^, ■ ■ ■ , M/t are set so that w, = i (in the base- 
q number system), which reflects the fact that the information 
bits Vq,v i, . . . , are stored (in that order) in the first k 
live parity blocks. The next m\—k index blocks are set to 
(0,0,. . .,0), thereby indicating that the corresponding (live) 
parity blocks are available to store information bits. The last 
2{k— 1) — m\ index blocks are set to (q—l,q—l,...,q—l) 
to indicate that the corresponding parity blocks are full (in 
fact, nonexistent). Finally, it is possible that in the process 
of enforcing parity (Xj) = for the first k live parity 
blocks, some of these blocks become full (this happens iff 
wt(*,) = (k/2)(q-l) - 1 and v t = at the end of 
the first stage, since k/2 is even by assumption). To ac- 
count for this fact, we set the corresponding index blocks to 
{cj— 1, q—1, . . . , q— 1). This completes the transition from the 
first stage to the second stage, which is invoked when the 
encoding map £q produces the erasure symbol E. 

Let us now summarize the foregoing discussion by giving 
a concise algorithmic description of the transition procedure. 

Transition procedure T\ : Partition the memory into 2 [n jk\ 
parity blocks of k/2 cells, and identify the m\ ^ 2(k— 1) 
parity blocks x\, Xi, . • . / x mi that are not full. If ni\ < k, 
output the erasure symbol E and terminate. Otherwise, set the 
2{k— 1) index blocks u\, u^, ■ ■ ■ , W2/c-2 as follows: 

(i for i = 1,2, ...,k 

Ui = I for i = k + l,k + 2,...,mi (4) 

[q^-1 for i = m 1 +l,m 1 +2, ...,2k -2 

where jx= |"log.(fc+2)] is the number of cells in each index 
block, then record the information vector (vq, v\, . . . , E>jt_i) 
in the first k live parity blocks x\,x^_, . . . , xj., as follows: 

for (i = l; z < k; i = i + l) 
if (parity (x,) ^ 

{ increment (%;) ; if (full (x,)) w, = q^ — 1;} 



The function full (x) determines whether the given block x 
(which could be a parity block or an index block) is full. The 
function increment (x) increases by one the level of a cell 
(does not matter which) in the given live block. 

During second-stage encoding and decoding, we will need 
to figure out for each active parity block x which of the k 
information bits it represents. To this end, we will have to find 
and read the index block u that corresponds to x. How exactly 
is the correspondence between parity blocks and index blocks 
established? Note that, upon the completion of the transition 
procedure 71, there is the same number of live parity blocks 
and live index blocks; moreover, the j-th live index block 
corresponds to the j-th live parity block, for all j. The encoding 
procedure will make sure that this correspondence is preserved 
throughout the second stage: whenever a parity block becomes 
full, it will make the corresponding index block full as well. 

We are now ready to present the encoding and decoding maps 
which are, again, specified in C-like pseudo-code notation. 

Decoding map T>\ : The input to this map is a cell-state vector 
x = {x\\x2\ ■ ■ ■ \x2m\ u l\ u l\ ' ' ' \ u 2k-l)> partitioned into 2m 



parity blocks, of k/2 cells each, and 2(k— 1) index blocks. 
The output is the information vector (vq, v\, . . . , Ujt-l)- 



(v ,vi,...,v k _i) = (0,0,. . .,0); 
for (£ = j = l; j^2m; j = j + l) 

if (full(Xy)) continue; // skip full blocks 
while (full(i^)) l = l + \; // skip full blocks 
i = U£-, 1 = 1 + 1; 
if (i ^ 0) Vf—i = parity (xi) ; 



Given an index i of the bit that has changed, the encoding map 
£ i first tries to find an active parity block x that represents the 
z'-th information bit. If such a block is found, it is incremented 
and checked to see if it is full (in which case the corresponding 
index block is set to q^ — 1). If not, another live parity block 
is allocated to represent the z'-th information bit. If no more 
live parity blocks are available, 

the erasure symbol E is returned. 
Encoding map £ j : The input to this map is a cell-state 
vector x = {x\\X2 \ • • • |*2m|| «ll M 2| ' ' ' \ u 2k-2)> partitioned 
into 2zzz parity blocks and 2(k— 1) index blocks, and an index 
i of the information bit that changed. Its output is either a 

cell-state vector y = (y^l 4 4 4 IVzmW u l 141 - - ' \ u 2k-2) or 
the symbol E. 



Note that when the second encoding stage terminates, there 
are at most k — 1 parity blocks that are not full, comprising at 
most k(k- l)/2 cells (at most k(k - l)(tj-l)/2 cell-levels). 



(3/1 1 3/2 1 ' ' ' \V2rn) = (Xl\x 2 \ ■ ■ ■ \x 2m ); 
( M 'll M 2l ' ' ' \ u 2k-2) = ( M ll M 2| ■ ■ ■ W2I-2); 

for {l = j = l; ]^2m; j = j + l) 

if (full(Xj)) continue; 
while (full(Mf)) 1 = 1 + 1; 
if (u e ==i + l) 

increment (yj) ; 

if (full (y ; )) u' e = q^-l; 

break; 

} 

else 1 = 1 + 1; 

} 

if ( j == 2m + 1) II active block not found 
for (£ = j=l; 2m; j = j + l) 

if (full(Xj)) continue; 
while (full(z^>)) 1 = 1 + 1; 
if (u e == 0) 

u' f = i + 1 ; 

if (parity (xj) 7^ Vj) increment (yj) ; 

if (full(j/ ; .)) u\ = q^-l; 

break; 

} 

else 1 = 1 + 1; 

} 

if (/== 2zn + 1 ) II no more available live blocks 
return E; 
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Once the maps T>\ and S\ are understood, it becomes clear 
that the same approach can be applied iteratively. The resulting 
flash code C* will proceed, sequentially, through s differ- 
ent encoding stages £q, £\, . . . , £ s -\, where s = [log 2 k] . In 
describing this code, we shall assume for the sake of simplicity 
that A: is a power of two, that is k = 2 s . If not, the same code 
can be used to store 2 s > k information bits, of which the last 
2 s — k are set to zero. Note that this will not change the order 
of the resulting write deficiency. 

To accommodate the encoding maps £ \, £2, ■ ■ ■ , £ s -i, we set 
aside for each map a batch of 2(k — 1) index blocks, with each 
index block consisting of ji = [log (? (/c+2)] cells. The transi- 
tion procedure % which bridges between the encoding maps 
£ r -\ and £ r (for some r G {2, 3, . . . , s — 1}) is identical to the 
transition procedure 7i, except for the following differences: 

Dl. The r-th batch of index blocks is used; and 
D2. The parity blocks consist of k/2 r cells each. 
In addition to Dl and D2, the decoding/encoding maps T> r and 
£ r differ from T>\ and £\ in that "2m" should be replaced by 
"2' m" throughout, where m stands for [n/k\ as before. There 
are no other differences. 

Theorem 3. For s = [log 2 fc] , the write deficiency 
of the Hash code C* defined by the sequence of 
decoding/encoding maps Dq,T>\, . . . ,T> S _\ and 
£o,£\,.-.,£ s -i is 0(qk\og 2 k/logq). 

Proof. We consider the worst-case scenario for the number 
of cell levels that are either unused or "wasted" in the overall 
encoding procedure. As before, there are at most k — 1 cells 
that are unused due to the partition into [n/k\ blocks, of ex- 
actly k cells each, at the very first encoding stage. These cells 
contribute at most (q— 1) (k — 1) unused cell levels. The index 
blocks for the s — 1 encoding maps £\,£x, ■ ■ ■ ,£s-l contain 
2(fc — l)(s — l)fx cells altogether, thereby wasting at most 



2( 9 -l)(k-l)( s -l)|W (*+2)l = O 



qk log k 
logq 



(5) 



cell levels. In each of the s — 1 transition procedures, the situ- 
ation parity (%,-) 7^ can occur at most k times, and each 
time it occurs a single cell level is wasted. Finally, as in Theo- 
rem|2j when the encoding process £ , £\, . . . , £ s _i terminates 
there are at most k — 1 parity blocks that are not full and, in 
the worst case, each of them uses just one cell level. However, 
now these parity blocks contain only \k/2 s ~ ll \ = 2 cells each, 
and thus contribute at most (k — 1) (2q — 3) unused cell levels. 
Putting all of this together, we find that at most 



( 9 -l)(*-l)(2(s-l)rio g(f (*+2)l +3J + k(s-l) (6) 

cell levels are wasted or left unused. Clearly, this expression is 
dominated by ©, and thus bounded by O [qk log 2 A:/log q) . | 

For large q, the upper bound of 0(qk\og 2 k/logq) on 
the deficiency of our scheme can be improved by using 
a more efficient "packaging" of index blocks in the flash 
memory. As before, we allocate a batch of 2{k — 1) index 
blocks to each encoding stage except £$. But now, every 
index block will occupy fs' = [log 2 (fc+2)] cells rather than 
jU = [log (; (/c+2)] cells, and the indices will be written in 



binary rather than in the base-q number system. This allows 
index blocks that correspond to successive encoding stages to 
be "stacked on top of each other" in the same memory cells. 
Specifically, the encoding stage £\ will use only cell levels 
and 1 to record the indices in its index blocks. Once this stage 
is over, the index information recorded during 71 and £\ is 
no longer relevant, and the level of all the 2(k — l)^i' cells 
in the 2(k — 1) index blocks can be raised to 1. Thereafter, 
provided q ^ 3, the transition procedure T2 and the encoding 
map £2 can use cell levels 1 and 2 to record the relevant 
index information in the same memory cells. Proceeding in 
this manner, we can accommodate up to q — 1 batches of 
index blocks in 2(k — l)fx' memory cells. We shall refer to 
this indexing scheme as stacked binary indexing and denote 
the resulting flash code by C. 

Theorem 4. The write deficiency of the flash code C' defined 
by the sequence of decoding/encoding maps T>q, T>\, ... , X> s _i 
and £ , £\, . . . , £ s -\ that use stacked binary indexing is at most 
O(qklogk) ifq ^ log 2 fc, and at most 0(k log 2 k) otherwise. 

Proof. With stacked binary indexing, the number of cell le- 
vels wasted in all the 2(k — 1) (s — 1) index blocks is at most 



2{q-l){k-l) 



riog 2 (/c+2)] 



(7) 



Although for most values of k and q this is strictly less than ©, 
all the other terms in © are still dominated by Q. | 

Remark. If we need to store k symbols, rather than bits, 
over an alphabet of size i > 2, the same flash code can 
still be used, with an appropriate interface. With the linear 
WOM-code of ifTTl . the £-ary symbols can be represented 
using (. — 1 bits in such a way that any symbol change 
corresponds to a single bit transition. The flash code C' can 
be now applied as is, and the resulting write deficiency is 
0(max{q, log 2 M} U log k£) . 

VI. Flash Codes of Constant Rate 

All of our results so far pertain to the case where n ^ k 2 . In 
this section, we briefly examine the situation where both k and 
n are large, while k/n = R for some constant R < 1. Observe 
that write deficiency 5(C) = n(q—l) — t is not an appropriate 
figure of merit in this situation: a trivial code that guarantees 
t = writes achieves write deficiency n(q— 1) = k(q—l)/R, 
which is within a constant factor 2/R from the lower bound (T|). 
Thus we will state our results in terms of the guaranteed num- 
ber of writes t rather than the write deficiency 5(C). 

If q = 2, we can easily guarantee O(n/log/c) writes as fol- 
lows: partition the n cells into blocks of size [log 2 A:] and each 
time an information bit changes, record its index in the next 
available block. For q > 2, the same method guarantees about 
[«/log^/cJ = Cl(n log ^/ log /c) writes, but we can do better. 

Let us partition the n cells into two groups: the index group 
consisting of n — k cells and the parity group consisting of k 
cells. The index group is then subdivided into m = [(n—k) /sj 
blocks, each consisting of s = [log 2 fc] cells. The writing 
proceeds in q — 1 phases. During the first phase, every time an 
information bit changes, its index is recorded in binary (using 



cell levels and 1) in the next available index block. After m 
writes, the first phase is over. We then copy the k information 
bits into the k cells of the parity group, and raise the level 
of all cells in the index group to 1. The second phase can 
now proceed using cell levels 1 and 2, and recording changes 
in information bits relative to the values stored in the parity 
group. At the end of the second phase, the current values of 
the k bits are recorded in the parity cells using levels 1 and 
2, and so on. This simple coding scheme achieves 

n(q-l)(l-R) 



m(q — 1) 



nq 



(8) 



log 2 fc V!ogk 

writes (where the middle expression ignores ceilings/floors by 
assuming that A: is a power of two and that n — k is divisible 
by log 2 k). If q is odd and R ^ 0.415, we can do a little better 
by using the ternary number system (cell levels 0, 1, 2) in both 
the index group and the parity group. In this case, the size of 
the parity group is [fc/log 2 3] cells and 1 — R in ^ can be re- 
placed by (log 2 3 - R) /2. Finally, for all R ^ 0.755 and q - 1 
divisible by three, the quaternary alphabet is optimal, leading 
to a factor of (2 - R) /3 rather than 1 - R in I®. 

VII. Buffer Codes 

Buffer codes were first presented by Bohossian et al. in |fl~). 
In this family of codes, a buffer of r symbols has to be stored 
in n flash memory c/-ary cells. After each write, the last r 
symbols that were written have to be recovered by the cell- 
state vector. The goal is to maximize f, the number of write 
symbols that the code guarantees without incurring a block 
erasure. In 0~), ifTOl . an upper bound and a construction are 
presented for the case where the buffer is stored in a single cell. 
It is also shown how to store a buffer where, n, the number 
of cells satisfies n ^ 2r. 

A. Buffer Codes Definition 

We refer to the set of vectors in {0, ...,£ — 1}' as buffer 
vectors. Similarly to a flash code, a buffer code C is also 
specified by an encoding map £ and a decoding map T>. The 
decoding map T> : Aq — > {0, ...,£ — 1}' assigns for each 
cell-state vector x G Aq its buffer vector T> (x). The encoding 
map £: {0, ...,£— 1}xA„ — > Al 1 U {E} specifies for every 
symbol a 6 {0, ...,£ — 1} and cell-state vector x G Aq, 
another cell-state vector y = £(a,x) such that yj^Xj for 
all 1 ^ ;' < n, {V(y)) x = a and for 2 ^ i ^ r, 
(T>(y))i = (T>(x))i-\. In case such a y G A? does not exist, 
then £{i,x) = E. 

Definition. An (n,r,£,t)q buffer code C(T>,£) guarantees t 
writes if for all sequences of up to t symbol writes, the encoding 
map £ does not produce the block erasure symbol E. 

B. Single-Cell Buffer Codes 

In this section, we discuss the case where there is a single 
cell {n = 1) to store the buffer. A construction for this scenario 
where a binary buffer {£ = 2) is stored was given in |fl~), 
iflOl . This construction guarantees at least t = j^rrj + r — 2 



writes before a block erasure. An upper bound was given as 
well, which asserts that for every buffer code with one cell, 
the number of writes t has to satisfy 



'< [-£rri\ -r+L((?-l)mod (f-l) + l)\. 
Let us show here another upper bound for such codes. 
Theorem 5. For any (1, r, £, i)n buffer code C such thatq ^ 



q-£ r 



where <p is Euler's <p function. 



Proof: Let C(V,£) be a (l,r,£,t) q buffer code. After 
i ^ 1 writes, for each v 6 {0,1, . . . , £ — l} r , let 

Si(v) = {x | there is a sequence of j ^ i symbol 

writes ending in level x and T>(x) = v}, 

nii{v) = max X £s.( v \{x} is the maximum cell level that is 
possible to reach after i symbol writes such that T)(mj(v)) = 
v, and 

M,= £ |S f (D)|. 
ve{o,..J-iy 

Clearly, for all i ^ t, M; ^ q — 1. After r writes, it is possible 
to reach any of the £ r different buffer vectors and thus M r 
P - 1. 

Let Q( r be the r-th order £-ary de Bruijn graph Q. Its 
vertex set is V( r = {0,1, . . . , £ — l} r and its edge set is 
£l r . Let v\,V2 G {0, 1, ...,£ — 1}'' be two different buffer 
states. Note that if (v\,V2) G £g r and m,-(i?i) > wt;^) 
then ?W;_|_i(t>2) > Miivz) an d therefore, the value of M (+ i 
increases by at least one level for every such an edge. In the 
de Bruijn graph, every cycle has at least one edge (1^1,02) G 
£( r such that Wf(fi) > m^v-i). Therefore, the number of 
new unused levels is at least the number of disjoint vertex 
cycles in Q^ r . This number is known to be j Y*d\r IP (3)^* Q' 
lfl6l . and therefore 



Lemma 6. The bound in Theorem\5\improves the bound in 01 
forq > P. That is, 

q-£ r 



J-IJ ■ r + [log, (((<? - 1) mod (f - 1)) + 1)J . 
Proof: Note that 

7$>(5) 

d\r 
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and therefore 

q-e 



£ r + £<p(r) 



+ r 



q-£ r 

r 

q + £cp(r) 



If we denote q — 1 = x{£ r — 1) + y, where ^ y ^ £ r — 1, 
then 



<7 



q + £<p(r) 
£ r + £cp{r) 



x{£ r - 1) + y + 1 + £q,{r) 




l r + £cp(r) Y _ 




x(£ r + £q,(r))-x + y + l-{ 


x — l)£(p{r) 



= XT + 

^ xr + 



+ £<p(r) 
-x + y + 1 - {x- l)£cp(r) 



£ r + £cp(r) 



(y + l)r 



Let us show that ^ log ( ,(y + 1). That is, we show that 

(y+1) > or 



The function /(x) = x-< is monotonically decreasing for 
x ^ 1 and since y ^ £ r — 1, we get 



{y + lWj > 

Putting these together we get 

q-£ r 



^xr+ Llog f (y + l)J 



+ r ^ xr + 


(y + l)r 







q-l 



r+ [log, {((q-1) mod -l)) + l)J. 



C. Multiple-Cells Buffer Codes 

In flTJ, iflOl . a buffer code construction is given for £ = 2 and 
arbitrary n,q,r, where n 2z\ This construction guarantees 
t = (q — l)(n — 2r + 1) + r — 1 writes. In this section, 
we show how to improve this construction such that the 
guaranteed number of writes is t — (q — l)(n — r). 

In the case where q = 2, the construction in (TJ, IflOl 
guarantees n — r writes. The encoding procedure is performed 
in such a way that after i writes, 1 ^ i ^ n — r, the buffer 
is located between the (z + l)-st and (z + r)-th cells, where 
the first bit of the buffer memory is stored in the (z ' + r)-th 
cell and the last bit is stored in the (z + l)-st cell. If q > 2, 
then the construction uses a "layer by layer" approach. That 
is, first the layer of levels and 1 is used, then the layer of 
levels 1 and 2 is used, and so on. In the transition from the 



layer of levels i — 1 and i to the layer of levels i and z + 1, 
all the cells are first reset to level i and the buffer is written 
in the new layer of levels i and i + 1. Then, it is possible to 
continue writing in this layer. Basically, on each layer, it is 
possible to write n — r times. However, when a new layer is 
used, then first the buffer from the previous layer is copied 
and then it is written in the new layer. Hence, it is possible 
to have only (zz — 2r + 1) more writes in the new layer and 
thus the total number of writes is 

n - r + (q - 2)(n - 2r + 1) = (q - l)(n - 2r + 1) + r - 1. 

The transition between these consecutive layers is not 
performed efficiently and our improvement here shows how it 
is possible to write n — r times on each layer such that the total 
number of writes is t — (q — l)(n — r). We first demonstrate 
how the construction works by the following example. 

Example 2. In this example, we show how the last construc- 
tion works for n = 11, q — 3,£ = 2 and r = 4, so the 
number of writes is 2 ■ (11 — 4) = 14. The sequence of bits 
to be written is 1, 1, 0, 0, 1, 0, 0, 1, 1, 1, 0, 1, 1, and the writes 
are performed as follows. The underlined cells represent the 
cells that store the buffer on each write. 



Written Bit 


Buffer State 


Cell State Vector 




(0,0,0,0) 


(0,0,0,0,0,0,0,0,0,0,0) 


1 


(0,0,0,1) 


(0,0,0,0, 1,0,0,0,0,0,0) 


1 


(0,0,1,1) 


(0,0,0,0,1,1,0,0,0,0,0) 





(0,1,1,0) 


(1,0,0,0,1,1,0,0,0,0,0) 





(1,1,0,0) 


(1,1,0,0,1,1,0,0,0,0,0) 


1 


(1,0,0,1) 


(1,1,0,0,1,1,0, 0, 1,0,0) 





(0,0,1,0) 


(1,1,1,0,1,1,0, 0,1,0,0) 





(0,1,0,0) 


(1,1,1,1,1,1,0, 0,1,0,0) 


1 


(1,0,0,1) 


(1,1,1,1,2,1,1,1,1,0,0) 


1 


(0,0,1,1) 


(1,1,1,1,2,2,1,1,1,0,0) 


1 


(0,1,1,1) 


(1,1,1,1,2,2,2,1,1,1,0) 





(1,1,1,0) 


(2,1,1,1,2,2,2, 1,1,1,1) 


1 


(1,1,0,1) 


(2,1,1,1,2,2,2,1,2,1,1) 


1 


(1,0,1,1) 


(2,1,1,1,2,2,2,1,2,2,1) 





(0,1,1,0) 


(2,1,1,1,2,2,2,1,2,2,1) 



Now we are ready to present the construction by specifying 
its encoding and decoding maps specification. 
Decoding map T>^: The input to this map is a cell-state 
vector x = [x\,%i, . . ■ ,x n ). The output is the corresponding 
information buffer vector (v\,vi, ■ • ■ , v r ). 



m = max (x\, Xi, ■ ■ ■ , x n ) ; 
zz m = f ind_repeat (m, x\, Xj, . ■ ■ , x n ) ; 
if («,„ ^ r) 
for (z = 1; i ^ r; i = i + 1) 



Vj = X r 



r+n m -t+ 



1 " m; 



else { 

for (z = 1 ; i ' ^ n m ; i = i + 1 ) 



Vj = x 



r+n 



for (z = n m + 1 ; i ^ r; i = i + 1) 



V; = X 



n+n„ 



- (m - 1) ;} 
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The function max (x\, x 2 , . . . , x n ) simply returns the 
maximum value of the cells x\, X 2l ■ ■ ■ , x n . The function 
find_repeat (m, X\,X 2 , ■ ■ -,x n ) returns the number of 
times the value m repeats in the cells x\,x 2 , ■ ■ . ,x n . If the 
value of n m is at least r then the buffer is stored between the 
(n m + l)-st and (n m + r)-th cells, and the buffer values are 
calculated by subtracting m from the value of each cell. If the 
value of n m is less than r then the buffer is stored cyclically 
in two cell groups: the last r — n m cells and the n m cells in 
locations r + 1, . . . , r + n m . In the first group, the buffer values 
are given by subtracting m — 1 from the cells' value and in 
the second group by subtracting m from the cells' value. 
Encoding map The input to this map is a cell-state 

vector x = [x\,x 2 , ■ ■ -,x n ), and a new bit b. Its output is 
either a cell-state vector y = (yj, y 2 , . . . ,y n ) or the erasure 
symbol E. 



On the first write, according to the bit value b, the first or 
the (r + l)-st cell changes its value to one. On the first write 
on each layer, the first n — r + 1 cells are increased to level 
m, and then the first or the (r + l)-st cell is increased by one 
level, according to the bit value b. For all other writes, if the 
value if b is one then we simply increase the (r + n m + l)-st 
cell by one level, and otherwise we increase the first cell of 
level m — 1 by one level. Finally, if it is one of the first r — 1 
writes in each level, then we need to update the last cell that 
stores the buffer to level m — 1 since it no longer stores the 
buffer and thus its level has to be updated. 

Next, we prove the correctness of the construction. 

Lemma 7. After s = x(n — r) + y, where 1 ^ y ^ n — r, the 
maximum cell level is x + 1 and there are y cells in level x + 1. 

Proof: According to the encoding map £\, u f, the maximum 
cell level increases every n — r writes, on the (i(n — r) + 
l)-st write, for ^ i ^ q — 2. Therefore, after s writes, 
the maximum cell value is x = \zr-p] ■ If y = 1 then the 
maximum cell value is x + 1 and we can see that exactly 



one cell changes its value to x + For all other writes, the 
maximum cell value does not change and exactly one cell 
changes its value to the maximum cell value which is x + 1. 

■ 

Theorem 8. The buffer code C(X ) 5 U f, Sbuf) stores the buffer 
successfully and guarantees t = (q — l)(n — r) writes. 

Proof: According to Lemma [7] after t = (q — l)(n — r) 
writes the maximum cell level does not reach level q and hence 
there is no need to erase the block of cells. We prove the 
correctness of the encoding and decoding maps to store the 
correct value of the buffer by induction on the number of 
writes s. This is done by proving that for all 1 ^ s ^ t, 
such that s = x(n — r) + y, where 1 ^ y ^ n — r, the 
buffer (v\, . . . , v r ) is calculated successfully according to the 
decoding rules of the decoding map: 

1) If y ^ r then for 1 ^ i ^ r, v\ = x r+ y_i + \ — m. 

2) If y < r then for 1 ^ i ^ y, z\ = x r+ y_i + \ — m and 
for y + 1 ^ i < r, v { = x r+y _ i+1 - (m - 1). 

It is straightforward to verify that after the first write the 
memory successfully stores the buffer. Assume the assertion 
is correct after the s-th write, where 1 ^ s = x(n — r) + y ^ 
t — 1, 1 ^ y ^ n — r. Assume that the new bit to be written 
to the buffer on the (s + l)-st write is b and let us consider 
the following cases: 

1) If y = n — r, then on the (s + l)-st write in the encoding 
map the value of n m is n — r. Thus the first n — r + 1 
cells change their value to m = x, the values of the last 
r — 1 cells do not change, and if b = 1 then y r +\ = m + 
1, and otherwise y\ — m + 1. Therefore, the new value 
of the buffer is also given according to the decoding 
rules. 

2) If y < n — r, then n m = y < n — r, and the value of 
the (r + n m + l)-st cell increases by b so the buffer is 
shifted one place to the right and it stores its updated 
value. If b — 0, then we increase the first n m + 1 cells 
by one level. Note that n m = y and there are exactly y 
cells with the maximum value so we can always find a 
cell of value less than m and increase the value to m. 
Then, the buffer is again stored according to the above 
decoding rules. 



VIII. Conclusion 

Rewriting codes for flash memories are important as they 
can increase the lifetime of the memory. Examples of such 
codes are flash codes (9) and buffer codes |fl~). A signif- 
icant contribution in this paper is an efficient construction 
of flash codes that support the storage of any number of 
bits. We show that the write deficiency order of the code is 
0(klogk - max{log 2 k, q}), which is an improvement upon 
the write deficiency order of the equivalent constructions 
in iflOl . ifTTI . Ifl9l . The upper bound in |9| on the guaranteed 
number of writes implies that the order of the lower bound on 
the deficiency is 0(kq). Therefore, there is a gap, which we 
believe can be reduced, between the write deficiency orders 



(y\,yz,...,y n ) = (x\, x 2 , . . . , x n ) ; 
m=max(x 1 ,x 2 , ■ . ■ ,x n ) ; 
n m = f ind_repeat (m, x\, x 2 , ■ ■ ■ ,x„) ; 
if (m == 0) { // if this is the first write 

±f(b == 1) y r+x = 1; 

else yi = l; } 
if (n m == n — r) { // first write in this layer 

for (i = l; i^n-r+1; i = i + 1) 
Vi = m; 

if (b == 1) y r+1 = m + l; 

else yi = m + 1; } 
if (n m < n - r) { // not the first write in this layer 

yr+n m +\ = Vr+rim+l + b; 

±f(b == 0) 
for (i = l; i ^ n m + r; i = i + 1) 
if (y, == m - 1) { 

Vr+n m +\ = y r +n m +l + U break; } } 
if (n m ^ r - 1 ) // one of first r — 1 writes in this layer 
Vn-r+l+nm = m-l; 



of our construction and the lower bound. For buffer codes, we 
showed how to improve an upper bound on the number of 
writes in the case where one cell is used to store the buffer. 
If there are multiple cells, we showed a construction that 
improves upon the one presented in fT|, iflOl . 
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